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Abstract 


The objective ot this paper is to review the use. in mutagenesis, of various mathematical models to describe the 
dose-response relationship and to try to identify thresnolds. It is often taken as axiomatic that genotoxic carcinogens could 
damage DNA. at any level of exposure, leading to a mutation, and that this could ultimately result in tumour development. 
This has led to the assumption that for genotoxic chemicals, there is no discernible threshold. This assumption is 
increasingly being challenged in the case of aneugens. The distinction between 'absolute and 'pragmatic' thresholds is made 
and the difficulties in determining 'absolute' thresholds using hypothesis testing approaches are described. The potential of 
approaches, based upon estimation rather than statistical significance for the characterization of dose-response relationships, 
is stressed. The achievement of a good fit of a mathematical model to experimental data is not proof that the mechanism 
supposedly underlying this model is operating. It has been argued, in the case of genotoxic chemicals, that any effects 
produced by a genotoxic chemical which augments that producing a background incidence in unexposed individuals will 
lead to a dose-response relationship that is non-thresholded and is linear at low doses. The assumptions underlying this 
presumption are explored in the context of the increasing knowledge of the mechanistic basis of mutagenicity and 
carcinogenicity. The possibility that exposure to low levels of genotoxic chemicals may induce and enhance defence and 
repair mechanisms is not easily incorporated into many of the existing mathematical models and should he an objective m 
the development of the next generation of biologically based dose-response (BB-DR) models. Studies aimed at detecting or 
characterizing non-linearities in the dose-response relationship need appropriate experimental designs with careful attention 
to the choice of biomarker, number and selection oi dose levels, optimum allocation of experimental units and appropriate 
levels of replication within and repetition of experiments. The characterization of dose-response relationships with 
appropriate measures of uncertainty can help to identify pragmatic" thresholds based upon biologically relevant criteria 
which can help in the regulatory process. £-2000 Elsevier Science B.V. All rights reserved. 
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1. Introduction 

The objective of this paper is to review the use. in 
mutagenesis, of various mathematical models to de- 
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scribe the dose-response relationship and to try to 
identify thresholds. It has usually been considered 
axiomatic that chemicals which damage the genetic 
material produce adverse effects which are non- 
thresholded. It is assumed that these chemicals can 
damage DNA at any level of exposure and such 
damage could result in tumour development. In this 
model of carcinogenesis, it is believed that there is a 
finite probability that a single molecule of a carcino¬ 
gen may be sufficient to damage the genetic material 
irreversibly. Such a mutated cell may subsequently 
proliferate, eventually leading to a tumour. This 
model of no-safe-dose of a carcinogen or. alterna¬ 
tively. the single molecule theory of carcinogenesis 
is closely related to the one-hit models developed for 
investigating the low-dose effects of ionizing radia¬ 
tion [lj. The no-safe-dose of radiation has been a 
basic principle of radiation biology as developed by 
organisations such as the US National Council on 
Radiation Protection (NCRP) [2.3]. 

It is extremely difficult, if not impossible, to test 
the hypothesis of the no-threshold model of carcino¬ 
genesis because it has all the philosophical problems 
of trying, in effect, to design an experiment to prove 
a negative, i.e, that there is a threshold below which 
there are doses of a carcinogen where no adverse 
effects occur. Conventional rodent carcinogenicity 
bioassays lack the stadstical power to resolve whether 
a threshold may exist in carcinogenesis while the 
results from larger-scale experiments deliberately de¬ 
signed to test for effects of carcinogens at low doses 
have proved to be equivocal. In many senses, the 
debate is unproductive and an example of what 
Weinberg [4] calls trans-science: the formulation of a 
problem in scientific terms without the ability to 
investigate the problem using an experimental ap¬ 
proach. However, the increasing availability of a 
w-ide range of biomarkers provides the opportunity to 
define responses at dose levels closer to possible 
human exposures. 

Thresholds, however, are an accepted principle in 
pharmacology for many receptor-activation-depen¬ 
dent processes such as nerve fibre firing. The argu¬ 
ment for the existence of thresholds for toxicological 
processes is based upon homeostatic mechanisms 
which allow an organism to compensate for and 
adapt to toxicological insults. These mechanisms 
include the ability to detoxify a chemical before it 


reaches a critical target and 10 repair damage front 
low-level exposure before adverse effects occur. True 
threshold dose levels can also be postulated w'hich 
have to be exceeded before, e.g., cytotoxicity or the 
saturation of detoxification pathways occurs. 

2. Mutagenesis and aneuploidv 

Mutagenesis includes a range of genetic damage 
including gene mutation, chromosomal damage and 
changes in chromosomal number. Such damage may 
arise either from direct damage to DNA or indirectly 
by effects on other targets such as proteins. The 
axiom that, chemicals w’hich cause genetic damage 
have no thresholds, is being challenged for aneugens 
(chemicals which induce aneuploidv). Aneugens may 
act by effects on non-DNA targets such as the 
inhibition of spindle function with no direct interac¬ 
tion of the aneugen with the DNA. It has been 
argued that these mechanisms have a threshold of 
action in contrast to DNA-reactive mutagens where 
covalent binding to DNA occurs and which are 
assumed to be non-thresholded. 

A postulated mechanistic basis for a threshold for 
aneuploidv is by binding to a target receptor such a^. 
tubulin. It is argued that a critical number of target 
sites must be occupied before the biological effect 
occurs, such that there is a degree of redundancy in 
this target. This binding to the target receptor is also 
usually reversible, such that repeated exposure at low- 
doses is not additive, resulting in a dose-response 
w ith a threshold [5], 

Any carcinogenicity or mutagenicity resulting 
from aneuploidy is. thus, considered a secondary- 
consequence of a toxicological event that has a 
threshold, this should then be incorporated into risk 
assessment procedures. Aardema et al. [6], for exam¬ 
ple, note that “Aneuploidy induction which does not 
involve the direct interaction of a chemical or its 
metabolite(s) with DNA is expected to have a thresh¬ 
old. This must be considered in the risk assessment 
for such chemicals: this is not addressed by current 
risk assessment guidelines". 

Parry et al. [7] proposed that chemicals exhibiting 
aneuploidy probably have thresholds. They sug¬ 
gested that experiments need to be conducted over a 
range of concentrations where there was no aneu- 
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ploidy induction and then over concentrations over 
which there was dose-related induction of. aneu- 
piotdy. Elhajoujt et al. [8] argued for the existence of 
a threshold for chemically induced aneuploidy in 
human lymphocytes using the in vitro micronucleus 
test for a number of aneugens. Tmwall and Ashby [9] 
considered that they had demonstrated a threshold at 
dose levels below 0.01 mg/kg for vincristine in the 
in vivo micronucleus. Haynes et al. [10] were, how¬ 
ever, unable to distinguish between various different 
statistical models including or excluding a threshold 
based upon their experimental studies. 

3. Dose-response models, non-linearity and 
thresholds 

Attempts to identify thresholds usually involve 
some experimental investigation of a dose or concen¬ 
tration-response relationship. Statistical models of a 
dose-response relationship are usually some form 
of: 

Y=F{d,. T,..r ; .*,) + E- 

where Y is the biological response, F describes a 
function and d, the administered dose or concentra¬ 
tion. _v, are covariates such as age. geno¬ 

type and sex while E is the error term or background 
noise. Often, such relationships are constrained to be 
monotonic such that the response at any dose level is 
as high as cr higher than that at the zero dose level. 
Such dose-response relationships can either include 
or exclude a threshold. 

A linear equation is one where all the non-con¬ 
stant terms are first degree. For example: Y = a + bx. 
A non-linear equation is one where one or more of 
the non-constant terms are more than the first degree. 
For example: Y = a + They may result in a 
curvi-linear relationship but this does not automati¬ 
cally imply a threshold. Non-linear relationships can 
either have a threshold or not, Fig. 1 illustrates two 
hypothetical non-linear relationships — one with a 
threshold, the other without. A threshold is defined 
as a concentration (, X.) below which no effect oc¬ 
curs. 

In the non-linear example, this is expressed as: 

Y = a (for X = 0 to X,), 
r = fl + h(X-X ( ) 2 ( for X > X.). 
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It should be noted that the term. non-Smear. can be 
used in a number of different contexts. Elder and 
Kopp-Schneider [11], for instance, call a dose rela¬ 
tionship with a threshold and a linear dose-response 
relationship ibX) above the threshold non-linear. 

4. ‘Pragmatic’ vs. ‘absolute’ thresholds 

A threshold model assumes that an effect only 
occurs at doses above a certain threshold dose level. 
ECETOC defined an ’absolute' threshold as ”... a 
concentration below which a cell would not 'notice' 
the presence of the chemical. In other words, the 
chemical is present but does not interact with the 
cellular target". The precise identification of such a 
threshold, if it exists, is difficult [12). A 'pragmatic' 
threshold can be considered as a concentration below 
which any effect is considered biologically unimpor¬ 
tant (Fig. 2) [13]. This term is used in a somewhat 
similar way to how ECETOC defines a biological 
threshold except that it implies that there may be 
effects occurring because of treatment or exposure 
but these are considered below what might be con¬ 
sidered biologically important. An example might be 
increases that did not exceed the range of responses 
seen in negative control material in a well-conducted 
series of experiments, Such a threshold may be 
defined, in part, with the help of statistical tests. The 
distinction between the various classifications of 
thresholds can initiate a philosophical discussion but 
is not relevant to ’egulatory risk assessment. 
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Fig. 2. Diagram to illustrate the concept of a biologically unimpor¬ 
tant effect. The shaded area includes areas of the relationship 
where there is a dose-related increase but the response has not 
reached an effect considered of sufficient biological importance: 
for instance, has not exceeded the underlying background control 
incidence from a senes of well-conducted experiments. 


The use of safety factors (SFs) in toxicology to 
determine levels such as acceptable daily intakes 
(ADls) (as opposed to the mathematically based 
low-dose linear extrapolation method) makes the as¬ 
sumption of a 'pragmatic' threshold based upon the 
identification of no or lowest observed (adverse) 
effect levels [NO(A)EL or LO(A)ELx] which are 
determined, to some extent, by statistical signifi¬ 
cance. The determination of NO(A)ELs from small 
experiments cannot, of course, guarantee zero nsk at 
these levels of exposure. However, the experimental 
demonstration of these 'pragmatic' thresholds related 
to increased incidences considered to have no biolog¬ 
ical importance is easier than for an ‘absolute' 
threshold for these non-stochastic effects [14], 


5. Identification of thresholds by statistical tests: 
hypothesis testing vs. estimation 

Statistical approaches to the identification of ‘ab¬ 
solute’ thresholds based upon the use of hypothesis 
testing are unsatisfactory. Conventionally, this ap¬ 
proach involves comparing a null hypothesis such as 
a control = a treated group (i.e., a threshold exists) 
with an alternative hypothesis of a control < a treated 
group (i.e.. no threshold exists). The 'poorer - the 
experiment, the harder it is to reject the null hypothe¬ 
sis and. as a consequence, to rule out the possibility 


of ever smaller but real increases: the size of the 
experiment has to be increased disproportionately. 

The problem is sometimes incorrectly described 
as the experimental designs used lacking sufficieni 
statistical power to detect a threshold. However, the 
problem is, rather, that the null hypothesis of no 
difference in a comparison between two groups (i.e.. 
below the threshold) can only be accepted provision¬ 
ally. It cannot, therefore, be said [8] that the "real 
biological threshold value for aneuploidy induction 
will basically be situated between the last statisti¬ 
cally non-significant concentration and the first sta¬ 
tistically highly significant concentration”. Such a 
statement might, however, provide an adequate de¬ 
scription of the catena for a 'pragmatic' threshold. 

A failure to detect an increase, defined by a statisti¬ 
cally significant effect, is not proof of the absence of 
an effect at a low dosage. Determination of an 
increase, even if this does not reach statistical signif¬ 
icance. may also mean that some biological events 
are actually happening at low doses which may be 
relevant to regulatory arguments relating to no-safe- 
doses and linearity at low dose. However, non-sig- 
nificant increases are often inappropriately referred 
to as negative results. 

The definition of a 'pragmatic' threshold based 
upon a formal statistical test can also be misleading 
as the statistical significance is based upon the spe¬ 
cific statistical test, experimental design and hypoth¬ 
esis being tested, A non-significant increase based 
upon a pairwise comparison might be significant 
using another test such as. for instance, a trend test. 
Statistical methods may. consequently, detect signifi¬ 
cant effects as a consequence of non-linearity but not 
rule out small but real increases at low doses. 

Elhajouji et al. [8.15] provide examples of various 
modelling approaches for identifying thresholds for 
the induction of aneuploidy by a range of chemicals. 
Both papers provide numerous graphical examples of 
non-linear dose-response relationships which could 
be interpreted as evidence for a threshold. In the first 
paper, investigating in vitro micronuclei in human 
lymphocytes [8], a discontinuous regression model 
for the possible existence of a threshold was used 
which reflected a 'jump’ in the regression line, A 
piecewise linear regression (or breakpoint) 'model' 
was fitted to the centromere-positive (MNCen + ) vs. 
centromere-negative data. The breakpoint used for 
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the regression was the first statistically significant 
increase in MNCen 4- (using Fisher's exact test to 
compare the micronuciei frequencies and the Chi- 
square test for centromere-positive frequencies be¬ 
tween treated and control samples). 

In the second paper [15] using fluorescence in situ 
hybridization (FISH) with human lymphocytes, they 
defined a threshold in two ways. Firstly, as the point 
where a statistically non-significant increase (the last 
statistically non-signtficant concentration, NOEL) 
changes to a statistically significant increase (the 
first statistically significant concentration. LOEL). A 
piecewise linear regression was carried out with the 
threshold defined as the first dose showing a signifi¬ 
cant increase. Secondly, by estimating inflection 
points (the point where there is a change from 
concavity to convexity of a curve) using a polyno¬ 
mial model as an approximate determination of a 
threshold. 

Various curvi-hnear models had previously been 
fitted to the data. The polynomial model was chosen 
as the best fitting model on the basis of goodness of 
fit or r~ values and the least number of parameters 
needed. Mathematical models were also used to esti¬ 
mate the number of events ('hits') on the targets 
needed for the induction of chromosome loss and 
chromosome non-disjunction based upon multi-hit 
and multi-target equations [l6]. It was noted that 
there were differences between estimates of the 
thresholds based upon the inflection point derived 
from the mathematical model and those estimated 
based upon the statistical tests. It is important to note 
that the experiments show a marked non-linearity of 
the dose-response relationship with a non-significant 
slight increase and then a highly significant steep 
increase. This seems good evidence for a ’prag¬ 
matic'. but not necessarily an ’absolute', threshold. 

An alternative approach to hypothesis testing is 
based upon the development of confidence intervals 
on the dose-response relationship. It may be possi¬ 
ble. using this approach, to define the dose-response 
by estimating its size and shape. It would be possible 
to say. for instance, that evidence suggests that any 
increase, if it exists, is unlikely to exceed a certain 
value based upon, say. the upper 95% percentile 
confidence interval (Cl). It may thus be possible to 
determine a ’pragmatic' threshold representing a 
non-lineanty. 


6. Dose-response models and curve fitting 

The tvpes of models available for fitting dose-re¬ 
sponse relationships can be broadly divided into four 
categories: empirical curve fits such as polynomials: 
semi-empirical models such as based upon the pro¬ 
bit. logit or Weibul! distributions: weak theoretical 
models such as the multi-hit or multi-stage models; 
and strong theoretical models such as the biologi¬ 
cally-based dose-response (BB-DR) models. Weak 
and strong here referring to the quality of the under¬ 
lying biological basis of the model. 

A multitude of methods exist for empirical 
curve-fitting to data sets [17,18]. These include lin¬ 
ear. log-linear, exponential, logit log< p/W — />)), 
multiple and logistical regression, polynomials, 
splines and non-parametnc kemal density regression 
[19], Confidence intervals can be determined by 
standard statistical procedures or procedures based 
upon re-sampling such as the Jackknife. Bootstrap 
and cross-validation. The ST.ATISTICA software pack¬ 
age. for instance, has a wide range of possible curve 
fits available while another package, SigmaPlot Ver¬ 
sion 3.0. claims to include 100 curve fits. 

The majority of models used for curve fitting, 
however, have very little underlying biological basis. 
Some of the empirical models, such as the Weibull 
and Probit. are based upon an assumption of an 
underlying distribution of tolerances which has a 
superficial link to the concept of inter-individual 
variability in the thresholds for exposures to chemi¬ 
cals above which a toxicological event will result. 
The weaker theoretical models have the concepts of 
'hits' or ’stages' which can be tenuously linked to 
biological models of damage to cellular targets or 
stages in tumour development. The statistical proper¬ 
ties of many of these models have been recently 
reviewed [11]. 

The choice of curve-fitting model is often based 
upon fitting by 'eye' rather than by using any spe¬ 
cific curve-fitting statistical program. Fitting a model 
by ’eye' can often guarantee a good fit to the data 
but many different ’eye’ curves can often be fitted to 
the same data. It can then be difficult to distinguish 
between alternative models such as, for example, 
when a sigmoid curve (a non-threshold model) and a 
threshold-type model (Fig. 3) give equally good fits. 
As a result, it is not possible to discriminate between 
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Fig. 3. A diagram showing an example where two different 
models — a sigmoid curve with no threshold and a model 
including a threshold — can be fitted by ‘eye' to give similar 
good Fits to the same data and showing the difficulty of distin¬ 
guishing between alternative statistical models. 

biological models based upon the statistical models. 
Consequently, the existence of a relationship chosen 
to show a good fit based upon the data does not 
prove the existence of a threshold or that the under¬ 
lying mechanism postulated by the model reflects the 
actual mechanism. 

Model fitting is usually assumed to be parsimo¬ 
nious in that the simplest model is chosen in prefer¬ 
ence to more complex ones: the principle of Occam's 
Razor — that the simplest explanation consistent 
with the known facts is chosen — is often evoked by 
modellers. It is sometimes argued [11] that a non¬ 
threshold model should be fitted first because it is 
the simpler model having less parameters; only if 
this model fails to fit the data should the threshold 
model then be considered. The choice of model is 
based upon some criteria of a goodness of fit test. 
These are often some measures such as a Chi-square 
or r : statistic. Post-hoc justification of a model — 
by determining through a statistical test that the fit is 
good — introduces a circular argument and does not 
prove that the model is necessarily correct, 

7. Statistical models vs. biological models lor 
thresholds: BB-DR models 

The statistical models described previously are 
primarily data-driven, providing fits lo the observed 


data but with no or limited biological basis to the 
parameters in the model. The stronger theoretical 
BB-DR models have a more realistic basis, with the 
parameters in the model being mathematical repre¬ 
sentations of biological mechanisms. Example are 
the receptor-based models [20]. BB-DR models al¬ 
low stochastic effects to be simulated and scenarios 
to be developed for extrapolation and prediction. 

BB-DR offers the best prospect of providing real¬ 
istic mathematical models of the underlying biologi¬ 
cal mechanisms. This could allow the development 
of hypotheses, for instance, of the action ot a chemi¬ 
cal on a biological target, the inclusion of non-linear¬ 
ity because of metabolic overload; thresholds be¬ 
cause of the saturation of receptor occupancy, redun¬ 
dancy in the biological system or repair mechanisms. 

BB-DR models are, however, data-intensive, re¬ 
quiring extensive experimental data to test their as¬ 
sumptions. The potential certainly exists for further 
development of these BB-DR models but the amount 
of work required to enable their use for identifying 
thresholds should not be underestimated. 


8. Linearity at low dose and background inci¬ 
dence 

The assumption of low-dose linearity underlies 
many mathematical modelling approaches to quanti¬ 
tative risk assessment of carcinogens. These methods 
make a number of assumptions. The dose—response 
relationship is assumed to be monotonic with the 
same mechanism of action resulting in both the 
background and the treated responses under consid¬ 
eration. Only a fraction of the background response 
has to be by this same mechanism for this generality 
to hold. If the same mechanism of action can be 
shown to be responsible for both control and treated 
responses, then it may be reasonable to assume no 
threshold, If the mechanism of action is not the 
same, then the assumption of no threshold is weaker. 
The assumption of low-dose linearity applies just as 
much to non-cancer as to cancer endpoints [21]. 

The linear-at-low-dose effect is a strong assump¬ 
tion. If the treatment incidence is raised or in part 
raised because of the same mechanism of action, 
then the argument about linear at low dose ‘kicks 


PM3006451776 


Source: https://www.industrydocuments.ucsf.edu/docs/zjxj0001 



Mutuum Reteur.r. jfbt y«>'>' ,'fr-v5 


n f l 


in’. In the context of thresholds for aneuploidy, it is 
relevant to note that the background level of sponta¬ 
neous in vitro non-disjunction in the studies ot.Elha- 
jouji et al. [8] was T’T-lh'T. It may, thus, be diffi¬ 
cult to argue for an 'absolute' threshold in these 
studies unless it can be shown that the induced 
non-disjunction occurs by a different mechanism in 
these studies. 

The original proposal of identical mechanisms of 
action included when the concept of low-dose linear¬ 
ity was being proposed [22] may, however, now be 
too simple. With an increasing appreciation at the 
molecular level of the heterogeneity of the causes of 
toxicological events in both control and treated 
groups [23], it may be that the assumption only holds 
if the mechanisms are exactly the same, not just 
similar or variants of one another. Information on 
whether the underlying mechanisms responsible for 
spontaneous and induced aneuploidy are the same 
would thus provide some basis for deciding whether 
'absolute' thresholds are possible for aneugens. 

Most importantly and often understated are two 
further points: firstly, that the linearity only applies 
to very low doses (at the limit of the dose tending to 
zero) and appreciable non-linearity may apply at 
other dose levels; secondly, that the dose-response 
relationship at these low-dose levels may have a very 
low slope or potency and be very much lower (orders 
of magnitude) than that estimated based upon worst- 
case models which include data from high (or ex¬ 
tremely high) dose levels. 

The assumption of a linear model implies that 
there is no level of exposure to a "high-dose' car¬ 
cinogen that is not without some degree of risk at 
low dose. However, it is possible that low levels of 
exposure are. in fact, actually protective [24], 
Hormesis implies the existence of a threshold below 
which effects are beneficial rather than harmful. This 
may result in what are termed J- or U- and (3-shaped 
curves [13], Some of the mathematical models used, 
such as the Linearized Multistage Model, constrain 
the parameters in the model to be >0 so that it is 
not easy to model, for instance, repair processes and 
hormesis. Removing this constraint would mean that 
if a repair process induced by low level exposure 
were to act in the same way as the background repair 
mechanism, then the model should predict protective 
effects which are also linear at low dose. 


9. Experimental design for threshold studies 

The philosophical difficulties in designing experi¬ 
ments to show 'absolute' thresholds were discussed 
earlier. However, the existence of sensitive and spe¬ 
cific endpoints or biomarkers (for example, the use 
of selected chromosome probes in FISH) leading to 
low variability makes it possible to use a range of 
experimental designs for the characterization of any 
non-linearity in the dose-response relationship at 
low doses. 

Two basic options exist: firstly, a hypothesis test¬ 
ing approach for whether there is a difference be¬ 
tween the response in a negative control and a 
treatment; secondly, an estimation approach where 
the objective is to define and estimate the dose-re - 
sponse relationship. The experimental design — the 
number and spacing of doses and the allocation of 
experimental units between doses — depends upon 
the approach chosen and whether the data are quali¬ 
tative or quantitative. 

in vitro experiments may he able to detect quite 
small effects because of the sensitivity of some of 
the biomarkers available. However, careful experi¬ 
mental design with the use of the techniques of 
randomization and blocking is required to guard 
against artifactual results in the case of such sensi¬ 
tive endpoints. There is a need, in such studies, for 
both replication of individual units within experi¬ 
ments and repetition of experiments over time or 
place. In the case of in vivo studies, the inter-indi¬ 
vidual variability in response may be so large that 
any small but real effects at low dose will be 
swamped by the inherent biological variability of the 
system. 

An important statistical issue is identifying what 
is the independent experimental unit in a study; 
whether, for instance, it is the culture or the cell in 
an in vitro experiment. For example, a study may 
consist of two independent cultures with 1000 cells 
scored per culture al each dose level. The choice of 
carrying out a statistical analysis based upon either 
2000 cells or two cultures makes a large difference 
to the statistical power of the experiment. If, how¬ 
ever, the cell is used as the experimental unit, as is 
often the case when Chi-square or Fisher’s exact 
tests are used, and there are appreciable inter-culture 
differences, then the statistical significance of a hy- 
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pothesis lest can be seriously wrong because assump¬ 
tions underlying the test have been violated. Both the 
Chi-square and Fisher exact tests assume that all the 
experimental units, in this case the cells, are inde¬ 
pendent of one another. If there are culture differ¬ 
ences. this assumption can be seriously violated. 

The characterization of the dose-response rela¬ 
tionships at low dose is an objective of the estima¬ 
tion approach. In vitro studies need to be. at least, 
repeated so that there is a measure of the inter-ex¬ 
perimental variability in any dose-response relation¬ 
ship. It is important to identify the dose range where 
the threshold is likely to exist within or. alterna¬ 
tively. where the dose-response becomes highly 
non-linear. Dose levels should then span a range 
from zero (the negative control) to above the 
‘pragmatic - threshold. An important consideration is 
the use of extra replication around the area where the 
threshold is believed to occur and around doses 
where the accuracy of the dose-response relation¬ 
ship is important with the objective of improving the 
characterization of these regions. An appropriate 
biomarker should be chosen and. ideally indepen¬ 
dent replicate measures obtained at each dose level. 
An appropriate close-response model should be iden¬ 
tified and confidence intervals, such as the 95 c i 
percentile, obtained for the relationship to proside 
some measure of the uncertainty associated with the 
relationship. The dose-response models could be a 
statistical model or. it sufficient previous work has 
been carried out. a BB-DR model, power calcula¬ 
tions should be carried out during the experimental 
design stage to help develop an optimal allocation of 
the resources in the experiment. The sensitivity of 
mans of the possible biomarkers is such that experi¬ 
mental designs such as those based upon the factorial 
designs could be used to investigate more complex 
problems such as synergy and antagonism in the 
studies of mixtures. 


10. Conclusions 

Attempts are being made to harmonize different 
types of risk assessment with arguments made for 
the use of mathematical modelling approaches as 
opposed to ihe classic NO(A)EL/SF approach [25], 


However, quantitative estimate' ot carcinogenic risks 
from extrapolation using a linear dose-response 
model have resulted in much more stringent regula¬ 
tion of carcinogens than non-carcinogens. Central to 
these mathematical approaches has been the underly¬ 
ing assumption of no threshold and linearity at low 
dose. The argument has been made that such as¬ 
sumptions are general and applicable to non-geno- 
toxic endpoints just as much as eenotoxie endpoints 
[ 21 ]- 

'Absolute - thresholds are. however, a philosophi¬ 
cal problem in risk assessment. Statistical tests and 
mathematical models cannot prove or disprove their 
existence. Experiments to reject the hypothesis of 
linearity at low dose are impractical while a signifi¬ 
cance testing approach cannot be used to identify 
'absolute - thresholds. It is possible, however, to de¬ 
fine biologically unimportant increases and use this 
information to determine 'pragmatic - thresholds. 

Estimation procedures can put the importance of 
thresholds, if any. into perspective. Experiments can 
be designed to estimate the dose-response relation¬ 
ships. providing measures ot uncertainty rather than 
for formal hypothesis testing. Ideally, and increas¬ 
ingly in the future, such dose-response models 
should he ba>ed upon BB-DR modelling rather than 
empincai curve-fitting. 
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